Recent advances in leveraging human guidance for sequential decision-making tasks
نویسندگان
چکیده
A longstanding goal of artificial intelligence is to create agents capable learning perform tasks that require sequential decision making. Importantly, while it the agent learns and acts, still up humans specify particular task be performed. Classical task-specification approaches typically involve providing stationary reward functions or explicit demonstrations desired tasks. However, there has recently been a great deal research energy invested in exploring alternative ways which may guide may, e.g., more suitable for certain less human effort. This survey provides high-level overview five recent machine frameworks primarily rely on guidance apart from pre-specified conventional, step-by-step action demonstrations. We review motivation, assumptions, implementation each framework, we discuss possible future directions.
منابع مشابه
Group Decision-Making Models for Sequential Tasks
The sequential probability ratio test (SPRT) and related drift-diffusion model (DDM) are optimal for choosing between two hypotheses using the minimal (average) number of samples and relevant for modeling the decision-making process in human observers. This work extends these models to group decision making. Previous works have focused almost exclusively on group accuracy; here, we explicitly a...
متن کاملDecision-Making in Research Tasks with Sequential Testing
BACKGROUND In a recent controversial essay, published by JPA Ioannidis in PLoS Medicine, it has been argued that in some research fields, most of the published findings are false. Based on theoretical reasoning it can be shown that small effect sizes, error-prone tests, low priors of the tested hypotheses and biases in the evaluation and publication of research findings increase the fraction of...
متن کاملChallenges for Communication Decision-Making in Sequential Human-Robot Collaborative Tasks
Effective communication between teammates is critical to the success of collaboration, including human-robot collaboration. For enabling human robot communication, several modalities are actively being researched — such as, text, speech, visual signals, and legible motion. The design of these modalities is necessary to achieve effective communication; however, it is not sufficient. Communicatio...
متن کاملConvergence in a sequential two stages decision making process
We analyze a sequential decision making process, in which at each stepthe decision is made in two stages. In the rst stage a partially optimalaction is chosen, which allows the decision maker to learn how to improveit under the new environment. We show how inertia (cost of changing)may lead the process to converge to a routine where no further changesare made. We illustrate our scheme with some...
متن کاملStructure Learning in Human Sequential Decision-Making
Studies of sequential decision-making in humans frequently find suboptimal performance relative to an ideal actor that has perfect knowledge of the model of how rewards and events are generated in the environment. Rather than being suboptimal, we argue that the learning problem humans face is more complex, in that it also involves learning the structure of reward generation in the environment. ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Autonomous Agents and Multi-Agent Systems
سال: 2021
ISSN: ['1387-2532', '1573-7454']
DOI: https://doi.org/10.1007/s10458-021-09514-w